Extraction of Topical Consumer Products from Weblogs

نویسندگان

  • Shinichi Nagano
  • Masumi Inaba
  • Yumiko Mizoguchi
  • Takahiro Kawamura
چکیده

This paper proposes a new algorithm of associated topic extraction, which detects related topics in a collection of blog entries commenting on a specified topic. The main feature of the algorithm is to evaluate how important a topic is to the collection, according to the popularity of blog entries through Trackbacks and comments. Another feature is to utilize product ontology for excluding unrelated topics. Evaluation results show that the proposed algorithm can capture users’ impressions of associated topics more accurately than TF-IDF. Introduction Consumer Generated Media (CGM) has been one of the major “word-of-mouth” media, and has a significant impact not just on customers’ purchase decision processes but also on companies’ product marketing strategies. Analysis of CGM posed a challenge in product marketing(Facca & Lanzi 2005; Mishne & de Rijke 2006).Sentiment analysis is a typical CGM analysis, which extracts evaluation expressions from CGM contents commenting on a specified product to summarize its reputation. Its technology is still in the early phase of application to practical product marketing. We have developed a reputation analysis system(Kawamura et al. 2007), which retrieves blog entries commenting on a specified product, and then extracts reputation expressions and related products from the blog entries. The system indicates the overall rating of the product reputation (positive vs. negative), and suggests related products that are much discussed in the blog entries. This paper proposes a new algorithm of associated topic extraction for suggestion of related products. The main feature is the evaluation of how important a topic is to a collection of blog entries, according to the popularity of the blog entries, through Trackbacks and comments. Another feature is the utilization of product ontology for topic filtering to extract products relevant to or similar to the given product. The paper also briefly evaluates the proposed algorithm in comparison with TF-IDF, a widely-used topic detection algorithm. Copyright c ⃝ 2008, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. Associated Topic Extraction The purpose of associated topic extraction is to suggest similar or comparable products that are much discussed in retrieved blog entries commenting on a specified product. The straightforward approach to the extraction is to count frequency of occurrence of proper nouns in blog entries through morphological analysis. Morphological analysis segments and tokenizes text strings into a series of meaningful terms, and then adds the terms to complemental information such as part of speech, pronunciation, semantic information. In the straightforward approach, any blog entry and any noun string appearing in blog entries are uniformly enumerated. Although it may appear to be a plausible approach such a uniform approach does not accord with blog readers’ impression. This paper proposes a new algorithm for associated topic extraction, abbreviated to AT. The main feature is that AT finds a blog thread, which is a set of blog entries spanned by Trackbacks, and then biases the frequency occurrence of proper nouns according to the thread size. This is derived from the following belief. Suppose that blog entries conversing about a target product receive many Trackback pings. Then, the entries may be especially interesting to many bloggers, and products mentioned in the entries may be similar to the target product or competitive with it. The degree of association AT (t) for each extracted topic t is formulated as follows. Let TF (t) and IF (t) be the topic occurrence frequency t and the inverse document frequency for topic t. Given a set of blog entries conversing about a certain product, then AT (t) is defined as follows: AT (t) = (TFR(t) + TFIR(t))× IF (t) (1) Note that TFR(t) and TFIR(t) weight a topicality of topic t by correlation between blog entries. Another feature of AT is to utilize product ontology. Product ontology is composed of an is-a relation representing hierarchical categorization of products and an instance-of relation representing mappings of individual products with product categories. Referring to product ontology, AT finds only similar or competitive products mentioned in the blog threads conversing about a target product to exclude out proper nouns other than product names. We have developed product ontology including more than 400 thousand consumer products including cars, DVDs, and home appli-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mobile Service for Reputation Extraction from Weblogs - Public Experiment and Evaluation

In this paper, we introduce a mobile service that extracts reputations of a product from weblogs by cellular phones during shopping. If the user takes a photo of a product barcode on the package with a cellular phone camera, Ubiquitous Metadata Scouter first gets the product metadata (name, manufacturer, etc.) from the internet and collects blogs that review the product. Also, it analyzes the b...

متن کامل

Preparation and physicochemical evaluation of topical formulations of purified curcuminoids from Curcuma longa rhizome

Background and objectives: The purpose of this study was optimization of semisolid topical formulation from ethanol extract of turmeric and evaluation of rheological characterization and investigation of physicochemical control tests. Methods: The ethanolic extract was prepared with Soxhlet method and the compounds were isolated with silica gel column chromatog...

متن کامل

Preparation of a topical herbal formulation for hyperpigmentation

Background and objectives: Skin hyperpigmentation usually indicates an increase in production and accumulation of melanin. Hyperpigmentary disorders are characterized by the appearance of melasma, freckles and lentigo on the skin surface which can cause psychological disturbances. Herbal products could be extensively preferable due to their widespread accessibility and the vast...

متن کامل

Presenting a local marketing model with emphasis on consumers' attitudes in order to buy Iranian products with a customer information behavior approach

Background and Aim: Considering the importance of customer behavior, the purpose of this study was to present a local marketing model with emphasis on consumerschr('39') attitudes to purchase Iranian products with a customer information behavior approach.  Research Method: The method of this research was data theory and data collection was done through semi-structured interviews. Also, the sta...

متن کامل

The Purchase of Foreign Products: The Role of Brand Image, Ethnocentrism and Animosity: Iran Market Evidence

Consumer purchasing decisions relating to foreign products are influenced by economic, psychological and sociological factors. This study attempts to relate such rarely studied variables as animosity and ethnocentrism to concepts taken from consumer behavior which are brand image and purchase intention. Consequently, this study attempts to provide insights into how brand image, consumer ethnoce...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008